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Abstract.  Since  Robotics  is  the  field  concerned  with  the  connection  of  perception 
to  action,  Artificial  Intelligence  must  have  a  central  role  in  Robotics  if  the  connection 
is  to  be  intelligent.  Artificial  Intelligence  addresses  the  crucial  questions  of:  what 
knowledge  is  required  in  any  aspect  of  thinking;  how  that  knowledge  should  be 
represented;  and  how  that  knowledge  should  be  used.  Robotics  challenges  AI  by 
forcing  it  to  deal  with  real  objects  in  the  real  world.  Techniques  and  representations 
developed  for  purely  cognitive  problems,  often  in  toy  domains,  do  not  necessarily 
extend  to  meet  the  challenge.  Robots  combine  mechanical  effectors,  sensors,  and 
computers.  AI  has  made  significant  contributions  to  each  component.  We  review  AI 
contributions  to  perception  and  object  oriented  reasoning.  Object-oriented  reasoning 
includes  reasoning  about  space,  path-planning,  uncertainty,  and  compliance.  We 
conclude  with  three  examples  that  illustrate  the  kinds  of  reasoning  or  problem 
solving  abilities  we  would  like  to  endow  robots  with  and  that  we  believe  arc  worthy 
goals  of  both  Robotics  and  Artificial  Intelligence,  being  within  reach  of  both.- 
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I .  Robot  i('s;m<l  Arf  ilicinllril  olligenro 


Hobo!  ics  is  t  lw  i nil  llitjt  nt  toiiht  t  t ioit  i>f  fit  rt  i  ft!  ion  to  tit  '{mu.  "I  lit-  kt  \  wold-'  in 
tli.it  sciiiciKT  arc  "intelligent  and  cone omit  out  "perception'.  NormalK  Robotics 
is  thought  of  .is  simply  the  connection  of  sensing  to  m  l  ion  using  computers.  The 
typical  sensing  modalities  of  current  robots  include  vision,  force  ;ind  tactile  sensing, 
as  well  as  proprioceptive  sensing  of  the  robot's  internal  state.  The  capa.citv  for 
action  is  provided  by  arms,  grippers,  wheels,  and,  occasionally,  legs. 


The  software  of  modern,  commercially  available,  robot  systems  such  as  t  he  IBM 
Tafia  [Taylor,  Summers,  end  Meyer  19*2],  the  l  nimation  IT  M  A  !  \  a  I  l‘(S(),  Shimano 
et.  al.  I'hS-lj,  and  the  Automatix  cvbervision  [Franklin  and  \anderBrug  BIS'J.  \  illers 
19X2]  includes  a  wide  variety  of  functions:  it  performs  trajectory  calculation  and 
kinematic  translation,  interprets  sense  data,  executes  adaptive  control  through 
conditional  execution  and  real  time  monitors,  interfaces  to  databases  of  geometric 
models,  and  supports  program  development.  It  does  some  of  these  tasks  quite  well, 
particularly  those  that  pertain  to  Computer  Science;  it  does  others  quite  poorly, 
particularly  perception,  object  modelling,  and  spatial  reasoning. 

The  intelligent  connection  of  perception  to  action  replaces  sensing  by  perception, 
and  software  by  intelligent  software.  Perception  differs  from  sensing  or  classification 
in  that  it  implies  the  construction  of  representations  that  are  the  basis  for 
recognition,  reasoning  and  action.  Intelligent  software  addresses  issues  such  as: 
spatial  reasoning,  dealing  with  uncertainty,  geometric  reasoning,  compliance,  and 
learning.  Intelligence,  including  the  ability  to  reason  and  learn  about  objects  and 
manufacturing  processes,  holds  the  key  to  more  versatile  robots. 


Insofar  as  Robotics  is  the  intelligent  connection  of  perception  to  action,  Artificial 
Intelligence  (AI)  is  the  challenge  for  Robotics.  On  the  other  hand,  however,  we  shall 
argue  that  Robotics  severely  challenges  Artificial  Intelligence  (AI)  by  forcing  it  to 
deal  with  real  objects  in  the  real  world.  Techniques  and  representations  developed 
for  purely  cognitive  problems  often  do  not  extend  to  meet  the  challenge. 


First,  we  discuss  the  need  for  intelligent  robots  and  we  show  why  Robotics 
poses  severe  challenges  for  Artificial  Intelligence.  Then  we  consider  what  is  required 
for  robots  to  act  on  their  environment.  This  is  the  domain  of  kinematics  and 
dynamics,  control,  innovative  robot  arms,  multi-fingered  hands,  and  mobile  robots. 
In  section  5,  we  turn  attention  to  intelligent  software,  focussing  upon  spatial 
reasoning,  dealing  with  uncertainty,  geometric  reasoning,  and  learning.  In  section 
6,  we  discuss  robot  perception.  Finally,  in  Section  7,  we  present  sortie  examples  of 
reasoning  that  connects  perception  to  action,  example  reasoning  that  no  robot  is 
currently  capable  of.  We  include  it  because  it  illustrates  the  reasoning  and  problem 
solving  abilities  we  would  like  to  endow  robots  with  and  that  we  believe  are  worthy 
goals  of  Robotics  and  Artificial  Intelligence,  being  within  reach  of  both. 
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2.  The-  not'd  for  int  rlligcii  robots 


Where  is  the  need  lor  intelligent  robot'-'1  (urrmt  i  uniniel!i:;ent  robots  work  line  so 
long  ,i>  !  lies  ;ire  ;t|>(>lied  to  si  in  pie  t  .I'-ks  in  .1 1  most  predie  1 . 1  b  l<  Ml  n.i  1  ions  pa  rt  s  oi  I  lie 
cor  reel  type  are  present  ed  in  posit  ions  and  orient  at  ions  t  bat  bardic  van  .  and  lit  t  le 
dexterity  is  required  lor  surtc-slul  completion  ol  the  task  The  huge  lommenial 
successes  of  robot  automation  have  been  ol  this  sort:  parts  traits!,  r  (including 
palletizing  and  packaging),  spot  welding,  and  spray  painting.  Automation  has  been 
aimed  largely  at  highly  repetitive  processes  such  as  these  in  major  industrial  plants. 

Hut  to  control  the  robot  s  environment  suflieient Iv.  it  is  typically  necessary  to  erect 
elaborate'  fixtures.  Often,  the  set-up  costs  associated  with  designing  and  installing 
fixtures  and  jigs  dominate  the  cost  ol  a  robot  application.  Worse,  elaborate 
fixturing  is  often  not  transferrable  to  a  subsequent  task,  reducing  the  flexibility 
and  adaptability  that  are  supposedly  the  key  advantages  of  robots.  Sensing  is 
one  way  to  loosen  up  the  environmental  requirements;  but  the  sensing  systems  of 
current  industrial  robots  are  mostly  restricted  to  two-dimensional  binary  vision. 
Industrial  applications  requiring  compliance,  such  as  assembly,  seam  welding  and 
surface  finishing,  have  clearly  revealed  the  inabilities  of  current  robots.  Research 
prototypes  have  explored  the  use  of  three-dimensional  vision,  force  and  proximity 
sensors,  and  geometric  models  of  objects  [Paugcras  1982,  Clorksin  et.  al.  1982, 
Trevelyan,  Kovesi,  and  Ong  1984,  Nakagawaand  Ninomiya  1984,  Porter  and  Mundy 
1982,  1984].  Other  applications  expose  the  limitations  of  robots  even  more  clearly. 
The  environment  cannot  be  controlled  for  most  military  applications,  including 
smart  sentries,  autonomous  ordinance  disposal,  autonomous  recovery  of  men  and 
materiel,  and,  perhaps  most  difficult  of  all,  autonomous  navigation. 


3.  Roboticspart of  Artificiallntelligence 

Artificial  Intelligence  (AI)  is  the  field  that  aims  to  understand  how  computers 
can  be  made  to  exhibit  intelligence.  In  any  aspect  of  thinking,  whether  reasoning, 
perception,  or  action,  the  crucial  questions  are: 

•  What  knowledge  is  needed.  The  knowledge  needed  for  reasoning  in  relatively 
formalized  and  circumscribed  domains  such  as  symbolic  mathematics  and  game 
playing  is  well  known.  Highly  competent  programs  have  been  developed  in  such 
domains.  It  has  proven  remarkably  difficult  to  get  experts  to  precisely  articulate 
their  knowledge,  and  hence  to  develop  programs  with  similar  expertise,  in  medicine, 
evaluating  prospective  mining  sites,  or  configuring  computers  (see  [Mich ie  1979, 
Hayes- Roth,  Waterman,  and  Lenat  1983,  Winston  19S3]  for  a  discussion  of  expert 
systems,  and  accounts  of  the  difficulty  of  teasing  knowledge  out  of  experts).  Among 
the  many  severe  inadequacies  of  the  current  crop  of  expert  systems,  is  the  fact  that 
they  usually  have  limited  contact  with  the  real  world.  Human  experts  perform  the 
necessary  perceptual  preprocessing,  telling  MYCl.Y  for  example  that  the  patient  is 
‘febrile  0.8’.  Moving  from  the  restricted  domain  of  the  expert.,  to  the  unrestricted 
world  of  everyday  experience,  determining  what  knowledge  is  needed  is  a  major 


step  tow.ird  mmlrllmf?  sicrro  \  i>ion  .  ;n  li n  \  in j?  hi |>rtl  \\ .ilk  mjt  .iml  d \  luin  ir  Im  hi in r. 
anil  re.i>oning  ;i  lion  t  uni  'haniMii-  .mil  >  ( > . » t  *  ■ .  W  tint  do  von  need  in  know  in  older 
In  tat t  h  a  ball’’ 

•  Iteprrm  tiling  hniut  ledi/t .  A  kt- >  ctml  ribut ion  of  Al  is  the  obx  i  \ ;it  ion  that 
know  ledge  should  be  represent'd  explicitly,  not  ltt-a \  i I \  encoded,  lor  c\;mi|>lc 
numerically .  in  ways  that  suppress  structure  and  constraint.  A  given  body  of 
knowledge  is  used  in  many  ways  in  thinking.  Conventional  data  structures  are 
timed  to  a  single  set  of  processes  lor  access  and  modification,  and  this  renders 
them  too  inflexible  lor  use  in  thinking.  Al  has  developed  a  set  of  techniques  such  as 
semantic  networks,  frames,  and  production  rules,  that  are  symbolic,  highly  flexible 
encodings  of  knowledge,  yet  which  can  be  efficiently  processed. 

Robotics  needs  to  deal  with  the  real  world,  and  to  do  this  it  needs  detailed 
geometric  models.  Perception  systems  need  to  produce  geometric  models;  reasoning 
systems  must  base  their  deliberations  on  such  models;  and  action  systems  need 
to  interpret  them.  Computer-aided  design  (CAD)  has  been  concerned  with  highly 
restricted  uses  of  geometric  information,  typically  display  and  numerically-controlled 
cutting.  Representations  incorporated  into  current  CAD  svteins  are  analogous  to 
conventional  data  structures.  In  order  to  connect  perception,  through  reasoning, 
to  action,  richer  representations  of  geometry  are  needed.  Steps  toward  such  richer 
representations  can  be  found  in  configuration  space  [Lozano-Perez  1981,  1983a], 
generalized  cones  [Uinford  1981],  and  visual  shape  representations  [Horn  1982, 
Ikeuchi  1983,  Rrady  1984,  Marr  1982], 

As  well  as  geometry,  Robotics  needs  to  represent  forces,  causation,  and 
uncertainty.  We  know  how  much  force  to  apply  to  an  object  in  an  assembly  to  mate 
parts  without  wedging  or  jamming  [Whitney  1983].  We  know  that  pushing  too  hard 
on  a  surface  can  damage  it;  but  that  not  pushing  hard  enough  can  be  ineffective 
for  scribing,  polishing,  or  fettling.  In  certain  circumstances,  we  understand  how 
an  object  will  move  if  we  push  it  [Mason  1983],  We  know  that  the  magnitude  and 
direction  of  an  applied  force  can  be  changed  by  pulleys,  gears,  levers,  and  cams. 

We  understand  the  way  things  such  as  zip  fasteners,  pencil  sharpeners,  and 
automobile  engines  work.  The  spring  in  a  watch  stores  energy,  which  is  released  to 
a  flywheel,  causing  it  to  rotate;  this  causes  the  hands  of  the  watch  to  rotate  by  a 
smaller  amount  determined  by  the  ratios  of  the  gear  linkages.  Representing  such 
knowledge  is  not  simply  a  matter  of  developing  the  appropriate  mathematical  laws. 
Differential  equations,  for  example,  are  a  representation  of  knowledge  that,  while 
extremely  useful,  are  still  highly  limited.  Forbus  [1983]  points  out  that  conventional 
mathematical  representations  do  not  encourage  qualitative  reasoning,  instead,  they 
invite  numerical  simulation.  Though  useful,  this  falls  far  short  of  the  qualitative 
reasoning  that  people  are  good  at.  Artificial  Intelligence  research  on  qualitative 
reasoning  and  naive  physics  has  made  a  promising  start  but  has  yet  to  make  contact 
with  the  real  world,  so  the  representations  and  reasoning  processes  it  suggests  have 
barely  been  tested  [Hobbs  and  Moore  1984,  Forbus  198.3,  Al  Journal  1984,  DcKlecr 
1975,  Winston,  Uinford,  Katz,  and  l.owry  1984], 


Robot  ir-  need-  In  irpri'-tnl  u  in  crl  .1 1 11 1  v  .  so  i  li.it  i «  .  -  <  1 1 1 ;  1 1  ■  ■  i.iii  - .11  <  <  - -I  i, II v 
overcome  il .  There  .ire  bounds  oil  I  lie  .icc  11  r.o  v  ol  1  ohot  ion. i  -  I enlei  '  ml  11  ors 
li.iv  (>  errors;  and  I  lion;;  li  we  1 .1  Ik  .1 1  tout  repel  il  1  v  e  work,  no  1  w  o  p.i  ri  --  ,ri  1  v  or  c\,n  1 1  y 
alike.  As  I  lit-  tolerances  on  robot  applications  become  Imliicr.  the  need  in  deal  with 
uncertainly.  and  to  exploit  redundancy.  become",  area  ter. 

•  knoirit  ilyt.  A I  has  also  uncovered  techniques  lor  iisIIil’  knowledge 

elleet  ivel  y.  One  problem  is  that  t  he  knowledge  needed  in  ativ  part  it  u  1  1  r  c.im- cannot 
be  predicted  in  advance.  Programs  have  to  respond  llexiblv  to  .1  non  di  terministie 
world  Among  the  techniques  ollercd  by  Al  are  search,  -trmtnre  matching, 
constraint  proposal  ion.  and  dependency-directed  reasoning.  One  approach  to 
const r.  tit  proposition  is  being  developed  in  models  ol  pctception  by  Ter/opoulos 
I9S3  .  /acker,  Hummel,  and  Hosenfeld  [1977].  Another  has  been  developed  by 
Mrooks  jlOM.  I9S2-  building  on  earlier  work  in  theorem  proving.  The  application  of 
search  to  Robotics  lias  been  developed  by  Goto  et .  al.  il!)S(),.  Lozano- 1’eroz  [1981], 
Gaston  and  Lozano-Perez  [1982].  Grimson  and  i.ozano-Perez  !|9.vtj.  and  Brooks 
[1983b].  Structure  matching  in  Robotics  has  been  developed  by  Winston.  Binford, 
Katz,  and  Lowry  [1984]. 

To  be  intelligent,  Robotics  programs  need  to  be  able  to  plan  actions  and  reason 
about  those  plans.  Surely  Al  has  developed  the  required  planning  technology? 
Unfortunately,  it  seems  that  most,  if  not  all,  current  proposals  for  planning  and 
reasoning  developed  in  Al  require  significant  extension  before  they  ran  begin  to 
tackle  the  problems  that  typically  arise  in  Robotics,  some  of  which  are  discussed  in 
Section  5.  One  reason  for  this  is  that  reasoning  and  planning  has  been  developed 
largely  in  conjunction  with  purely  cognitive  representations,  and  these  have  mostly 
been  abstract  and  idealized.  Proposals  for  knowledge  representation  have  rarely  been 
constrained  by  the  need  to  support  actions  by  a  notoriously  inexact  manipulator,  or 
to  be  produced  by  a  perceptual  system  with  no  human  preprocessing.  ACRONYM 
[Brooks  1981,  Brooks  and  Binford  1980]  is  an  exeption  to  this  criticism.  Another 
reason  is  that  to  be  useful  for  Robotics,  a  representation  must  be  able  to  deal  with 
the  vagaries  of  the  real  world,  its  geometry,  inexactness,  and  noise.  All  too  often, 
Al  planning  and  reasoning  systems  have  only  been  exercised  on  a  handful  of  toy 
examples. 

In  summary,  Robotics  challenges  Al  by  forcing  it  to  deal  with  real  objects 
in  the  real  world.  Techniques  and  representations  developed  for  purely  cognitive 
problems,  often  in  toy  domains,  do  not  necessarily  extend  to  meet  the  challenge. 


4.  Action 


!n  this  section,  we  consider  what  is  required  for  robots  to  art  on  their 
environment.  I  bis  is  the  subject  of  kinematics  and  dynamics,  control,  robot  arms, 
multi-fingered  hands,  and  locomoting  robots. 


\ 


1.1.  kinematics,  l)y namics,  and  Arm  Design 

Tin-  kirn1  in  a  l  ics  of  rolio  I  arms  is  one  of  tin'  lie  tier  understood  aie.is  of  Robot  ii  - 
[Paul  HIM.  Brady  el .  al.  I983i.  I’  it  c  need  lur  kinemalii  t  r  a  1 1  s  I  o  r  m  a  I  ions  arises 
because  programmers  prefer  a  dillerent  representation  ol  tlie  span'  of  eonlij'.ural  ions 
of  a  robot  titan  that  which  is  most  natural  and  ellicienl  for  control.  Robots  are 
powered  by  motors  at  tiie  joints  between  links.  Associated  with  a  motor  are 
quantities  that  define  its  position,  velocity,  acceleration,  and  torque.  For  a  rotary 
motor,  these  are  angular  positions,  angular  velocities  etc.  It  is  most  efficient  to 
control  robots  in  joint  spare.  However,  programmers  prefer  to  think  of  positions 
using  orthogonal,  cylindrical,  or  spherical  Cartesian  coordinate  frames,  according 
to  tiie  task.  Six  degrees  of  freedom  (DOF)  are  required  to  define  the  position  and 
orientation  of  an  object  iti  space.  Correspondingly,  many  robots  have  six  joint 
motors  to  achieve  these  freedoms.  Converting  between  the  joint  posit  ions,  veloeit  ies, 
and  accelerations  and  tiie  Cartesian  (task)  counterparts  is  tiie  job  of  kinematics. 
The  conversion  is  an  identity  transformation  between  the  joint  space  of  “Cartesian" 
arms  (such  as  the  IBM  7565)  and  orthogonal  (x,y,z)  Cartesian  space.  Cartesian 
arms  suffer  the  disadvantage  of  being  less  able  to  reach  around  and  into  objects. 
Kinematic  transformations  are  still  needed  to  spherical  or  cylindrical  Cartesian 
coordinates. 

The  kinematics  of  a  mechanical  device  are  defined  mathematically.  The 
requirement  that  the  kinematics  can  be  efficiently  computed  adds  constraint, 
that  ultimately  affects  mechanical  design.  In  general,  the  transformation  from  joint 
coordinates  to  Cartesian  coordinates  is  straightforward.  Various  efficient  algorithms 
have  been  developed,  including  recent  recursive  schemes  whose  time  complexity  is 
linear  in  the  number  of  joints.  Hollcrbach  [1983]  discusses  such  recursive  methods  for 
computing  the  kinematics  for  both  the  Lagrange  and  Newton-Fulcr  formulations 
of  the  dynamics.  The  inverse  kinematics  computation,  from  Cartesian  to  joint 
coordinates,  is  often  more  complex.  In  general,  it  does  not  have  a  closed  form 
solution  [Pieper  1968].  Pieper  [1968,  see  also  Pieper  and  Roth  1969]  showed  that 
a  “spherical”  wrist  with  three  intersecting  axes  of  rotation  leads  to  an  exact 
analytic  solution  to  the  inverse  kinematic  equations.  The  spherical  wrist  allows  a 
decomposition  of  the  typical  six  degree  of  freedom  inverse  kinematics  into  two  three 
degree  of  freedom  computations,  one  to  compute  the  position  of  the  wrist,  the  other 
to  compute  the  orientation  of  the  hand.  More  recently,  Paul  [1981],  Paul,  Stevenson, 
and  Rcnaud  [1984],  Featherstone  [1983],  and  Hollcrbach  and  Sahar  [1983],  have 
developed  efficient  techniques  for  computing  the  inverse  kinematics  for  spherical 
wrists.  Orin  and  Schrader  [1984]  have  investigated  algorithms  for  computing  the 
Jacobian  of  the  kinematic  transformation  that  arc  suited  to  VLSI  implementation. 

If  the  number  of  robot  joints  is  equal  to  six,  there  are  singularities  in  the 
kinematics,  that  is,  a  small  change  in  Cartesian  configuration  corresponds  to  a  large 
change  in  joint  configuration.  The  singularities  of  six  degree-of-freedom  industrial 
robot  arms  arc  well  cataloged.  Singularities  can  be  avoided  by  increasing  the 
number  n  of  joints,  but  then  there  arc  infinitely  many  solutions  to  the  inverse 
kinematics  computation.  One  approach  is  to  use  a  generalized  inverse  technique 
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Mo-t  attention  ha-  centered  on  open  kinematic  chain-  -nth  as  rolmt  arm-. 
Much  le—  work  has  been  done  on  closed  kinematic  chain-  -in  h  a-  I e •  e. e < !  lobot-  or 
multi  lingered  liands.  Iliro-c  et .  al.  10''!  have  dc-igncd  a  pantograph  mechanism 
lor  a  t|uadruped  robot  that  significantly  reduces  potential  energy  lo--  in  walking. 
Sali-bury  and  C  raig  ]I982  (see  also  Salisbury  19S2|i  have  used  a  number  ol 
computational  constraints,  including  mobility  and  optimization  ol  linger  placement, 
to  design  a  three-fingered  hand.  The  accuracy  and  dexterity  ol  a  robot  varies  with 
configuration,  so  attention  needs  to  be  paid  to  the  layout  of  the  workspace. 
Salisbury  and  Craig  [19X11  used  the  condition  number  of  the  Jacobian  matrix  (using 
the  row  norm)  to  evaluate  configurations  of  the  hand,  that  is  to  evaluate  points 
in  the  workspace.  Yoshikawa  [198-1]  has  introduced  a  measure  of  manipulability 
for  a  similar  purpose.  Roth  ,19811  reviews  the  application  to  Robotics  of  screw 
coordinates  to  link  kinematics  and  dynamics. 

The  dynamic  equations  of  a  robot  arm  (see  llolterbach  [1983])  consist  of 
n  coupled,  second-order,  differential  equations  in  the  positions,  velocities,  and 
accelerations  of  the  joint  variables.  The  equations  are  complex  because  they  involve 
terms  from  two  adjacent  joints,  corresponding  to  reaction  and  Coriolis  torques. 
Conventional  techniques  have  simplified  dynamics  by  dropping  or  linearizing 
terms,  or  have  proposed  table  look-up  techniques.  Recently,  "recursive"  recurrence 
formulations  of  the  dynamic  equations  have  been  developed  that: 

1.  Compute  the  kinematics  frotn  the  shoulder  to  the  hand  in  time  proportional 

to  n, 

2.  Compute  the  inverse  dynamics  from  the  force  and  torque  exerted  on  the  hand 

by  the  world  from  the  hand  to  the  shoulder,  again  in  time  proportional  to  n. 

J'lte  importance  of  this  result  is  threefold: 

•  I  irst,  it  suggests  that  a  more  accurate  inverse  plant  model  can  be  developed, 
leading  to  faster,  more  accurate  arms.  Friction  is  a  major  source  of  the  discrepancy 
between  model  and  real  world  Direct  drive  technology  [A-ada  arid  Kanade  1981, 
Asad. i  1982.  A-ada  and  Yousef -Ton mi  1981;  reduces  the  mismatch.  In  a  direct  drive 
arm.  a  motor  i-  directly  connected  to  a  joint  with  no  intervening  transmission 
elements,  such  as  gears,  chains,  or  ball  screws.  The  advantage-  are  that  friction 
and  backlash  are  low,  so  the  direct  drive  joint  is  backdrisable.  This  means  that  it 
can  be  controlled  using  torque  in-toad  of  po-ition.  Torque  control  is  important  for 
achieving  compliance,  and  for  feedforward  dynamics  compen-at ion . 
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•  Ilurd.  Ilollerhaeh  ;md  S;i Im r  Bl'vb  have  shown  1 11.1t  llicir  n  Iiiiimik  nl  ol 
leal  herstonc  s  tcchiiuiiic  lor  coin  pu  l  i  n*>  I  lie  inverse  kinematics  make-  avail,  title 
many  oh  the  Icrms  needed  lor  the  recursive  \cwion  1  niter  dynamics. 

Renand  |19V|'  has  developed  a  tnncl  iterative  l.agrnngiari  scheme  1 11.1t  1  <-i  pi  ices 
aliont  dot)  additions  and  dad  multiplies  lor  a  six  revolnte  joint  robot  arm.  The 
met  hod  lias  been  applied  to  manipulators  h a \  mg  a  General  I  ree  st  met  me  ol  revolnte 
and  prismat ic  joints.  Huston  and  Kelly  I9v_>  and  Kane  and  Levinson  L  have 
recently  adapted  Kanes  formulation  ol  dynamics  to  robot  structures. 

4.2.  Control 


Much  of  control  theory  has  developed  for  slowly  changing,  nearly  rig  '  stems. 
The  challenges  of  robot  control  are  several: 

•  Complex  dynamics.  The  dynamics  of  open-link  kinematic  chain  robots  consist 
of  n  coupled  second-order  partial  differential  equations,  where  n  is  the  number  of 
links.  They  become  even  more  complex  for  a  closed  multi-manipulator  system  such 
as  a  multi-fingered  robot  hand  or  locomoting  robot. 

•  Articulated  structure.  The  links  of  a  robot  arm  are  cascaded  and  the  dynamics 
and  inertias  depend  on  the  configuration. 

•  Discontinuous  change.  The  parameters  that  are  to  be  controlled  change 
discontinuously  when,  as  often  happens,  the  robot  picks  an  object  up. 

•  Range  of  motions.  To  a  first  approximation  one  can  identify  several  different 
kinds  of  robot  motion:  free  space  or  gross  motions,  between  places  where  work  is 
to  be  done;  approach  motions  (guarded  moves)  to  a  surface;  and  compliant  moves 
along  a  constraint  surface.  Each  of  these  different  kinds  of  motion  poses  different 
control  problems. 

The  majority  of  industrial  robot  controllers  are  open-loop.  However,  many  control 
designs  have  been  investigated  in  Robotics;  representative  samples  are  to  be 
found  in  [Brady  et.  al.  1983,  Paul  1981,  Brady  and  Paul  198-1] .  They  include 
optimal  controllers  [Kahn  and  Roth  1971,  Dubowsky  19S3);  model  reference  control 
[Dubowsky  and  des  Forges  1979];  sliding  inode  control  [Young  1978  (in  Brady  et.  al. 
1983)];  non-linear  control  [Freund  1981,  198  ij ;  hierarchical  control  [Salisbury  and 
Craig  1981  j:  distributed  control  [Klein  and  Wahawisan,  1982j;  hybrid  force- position 
control  [Raibert  and  Craig  1981  (reproduced  in  Brady  et.  al.  1983),  Klein  Olson, 
and  Pugh  1983];  and  integrated  system  control  [Albus  1983].  Cannon  and  Schmitz 
[1981]  have  investigated  the  precise  control  of  flexible  manipulators. 

4.3.  End  effectors 

Industrial  uses  of  robots  typically  involve  a  multi-purpose  robot  arm  and  an  end 
effector  that  is  specialized  to  a  particular  application.  End  effectors  normally 
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nu  ll  with  .l  - 1 1 1 o | c  |)()|  .  or  some  coiiitmi.it  ion  ()iir  profile  m  wph  u-ing  multiple 
single  D()l  grippers  i'  that  .i  large  percentage  ol  the  work  cycle  i>  spriii  <  !i:i  n<r,i  iifi, 
grippers.  This  tui >  inspired  rescan  h  on  I  tic  uin  ii.inn  .il  iii-i".n-  that  support  last 
gripper  change.  Anotlirr  prolilrm  is  that  the  apptoach  assume-  that  a  process  can 
he  divided  into  a  discrete  set  of  single  l)()F  operations. 

Multiple  arms  raise  the  problem  of  coord  in  at  ing  their  motion  while  avoiding  collision 
and  without  cluttering  the  workspace.  The  coordination  of  two  arms  was  illustrated 
at  the  Stanford  Artificial  Intelligence  Laboratory  in  1972  when  two  arms  combined 
to  install  a  hinge.  One  of  the  arms  performed  the  installation,  the  other  acted  as 
a  programmable  fixture,  presenting  the  hinges  and  other  parts  to  the  work  arm. 
Freund  [19S1.  has  presented  a  control  scheme  for  preventing  collisions  between  a 
pair  cooperating  robots.  Whenever  there  is  a  possibility  of  a  collision,  one  of  the 
arms  is  assigned  master  status  and  the  other  one  has  to  modify  its  trajectory  to 
avoid  the  master. 

In  contrast  with  such  end  effectors,  a  human  hand  has  a  remarkable  range  of 
functions.  The  fingers  can  be  considered  to  be  sensor  intensive  d  or  1  DOF  robot 
arms.  The  motions  of  the  individual  fingers  are  limited  to  curl  and  flex  motions 
in  a  plane  that  is  determined  by  the  abduction/ adduction  of  the  finger  about  the 
joint  with  the  palm.  The  motions  of  the  fingers  are  coordinated  by  the  palm,  which 
c  an  assume  a  broad  range  of  configurations.  The  dexterity  of  the  human  hand  has 
inspired  several  researchers  to  build  multi-func  tion  robot  hands. 

Okada  ■  1  described  a  hand  consisting  of  three  fingers  evenly  spaced  about  a 
planar  palm  The  work-pace  of  the  individual  fingers  was  an  ellipsoid.  The  three 
work-paces  intersected  in  a  point.  Okada  programmed  the1  hand  to  perform  several 
tasks  such  as  tighten  bolts.  Ilanaftisa  and  Asada  11977,  1979,  developed  a  hand 
consisting  of  three  evenK -spaced,  spring-loaded  fingers.  Hie  real  and  apparent 
spring  eon-tant-  of  the  lingers  were  under  program  control.  Stable  grasps  were 
defined  as  the  minima  of  a  potential  function.  The  definition  of  stability  in  two 
dimensions  wa  -  demonstrated  by  programming  the  hand  to  pick  up  an  arbitrary 
shaped  lamina  viewed  by  a  TV  camera. 

Salisbury  19x2,  see  also  Salisbury  and  Craig  19X2;  investigated  kinematic  and  force 
constraints  on  the  de-igu  of  a  tendon-driven  t  h  rec- fingered  hand  (see  Figure  1). 


Figure  1.  The  three- fingered  Robot  hand  developed  by  Salisbury  and  Craig  [l?)8lj.  Hach  finger 
has  three  degrees  of  freedom,  and  is  pulled  by  four  tendons.  The  hierarchical  controller  includes 
three  finger  controllers,  each  of  which  consists  of  four  I’ll)  controllers,  one  per  tendon.  Reproduced 
from  (Salisbury  15)82) 

The  goal  was  to  design  a  hand  that  could  impress  an  arbitrary  (vector)  force  in 
an  arbitrary  position  of  the  hand’s  workspace.  Four  of  the  parameters  defining  the 
placement  of  the  thumb  were  determined  by  solving  a  series  of  one-dimensional 
nonlinear  programming  problems.  A  hierarchical  controller  was  designed:  P1D 
controllers  for  each  tendon;  four  such  for  each  finger;  and  three  such  for  the  hand. 
To  date,  position  and  force  controllers  have  been  implemented  for  the  individual 
fingers.  A  force  sensing  palm  has  recently  been  developed  [Salisbury  1984].  It  can 
determine  certain  attributes  of  contact  geometries.  The  force  sensing  fingertips 
being  developed  for  the  hand  will  permit  accurate  sensing  of  contact  locations  and 
surface  orientations.  This  information  is  likely  to  be  useful  in  object  recognition 
strategies  and  in  improving  the  sensing  and  control  of  contact  forces. 

The  Center  for  Biomedical  Design  at  the  University  of  Utah  and  the  MIT  Artificial 
Intelligence  Laboratory  are  developing  a  tendon-operated,  multiple  DOF  robot  hand 
with  multi-channel  touch  sensing.  The  hand  that  is  currently  being  built  consists 
of  three  4  DOF  lingers,  a  4  DOF  thumb,  and  a  3  DOF  wrist  (total  19  DOF).  Three 
fingers  suffice  for  static  stable  grasp.  The  Utah-MIT  design  incorporated  a  fourth 
finger  to  minimize  reliance  on  friction  and  to  increase  flexibility  in  grasping  tasks. 
The  hand  features  novel  tendon  material  and  tendon  routing  geometry  (Figure  2). 
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Figure  2.  The  prototype  Utah/MIT  dextrous  hand  developed  by  Steven  Jarobsen,  John  Wood, 
and  John  llollerbarh.  The  four  fingers  cacii  have  four  degrees  of  freedom,  a.  The  geometry  of 
tendon  routing,  b.  The  material  composition  of  tendons.  (Reproduced  from  (Jacohsen  ct.  al  1984, 
Figures  2b  and  7]) 

4.4.  Mobile  robots 

Certain  tasks  are  difficult  or  impossible  to  perform  in  the  workspace  of  a  static 
robot  arm  [Giralt  1984].  In  large  scale  assembly  industries,  such  as  shipyards  or 
automobile  assembly  lines,  it  is  common  to  find  parts  being  transferred  along 
gantries  that  consist  of  one  or  more  degrees  of  linear  freedom.  Correspondingly, 
there  have  been  several  instances  of  robot  arms  being  mounted  on  rails  to  extend 
their  workspace.  The  rail  physically  constrains  the  motion  of  the  robot.  More 
generally,  the  robot  can  be  programmed  to  follow  a  path  by  locally  sensing  it. 
Magnetic  strips,  and  black  strips  sensed  by  infra-red  sensing  linear  arrays,  have 
been  used,  for  example  in  the  Fiat  plant  in  Turin,  Italy. 

More  ambitious  projects  have  used  (non-local)  vision  and  range  data  for  autonomous 
navigation.  Space  and  military  applications  require  considerable  autonomy  for 
navigation,  planning,  and  perception.  Mobile  robots  arc  complex  systems  that 
incorporate  perceptual,  navigation,  and  planning  subsystems.  Shakey  [Nilsson 
1909]  was  one  of  the  earliest  mobile  robots,  and  certainly  one  of  the  most  ambitious 
system  integration  efforts  of  its  time.  Later  work  on  mobile  robots  includes  [Dobrotin 
and  Lewis  1975,  Giralt,  Sobck,  and  Chatila  1977,  Giralt,  Chatila,  and  Vaissct  1984, 
Lewis  and  Johnson  1977,  Lewis  and  Dejczy  1973,  Harmon  1983a,  1983b,  Everett 
1982,  Moravec  1981,  1983,  1984]. 
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Figure  3.  I  lie  j*ix  legged  robot  dev (  loped  by  Mrl.bce.  Oriit.  Klein,  .eid  their  roll-  .igues  ;it 
the  Ohio  State  ('Diversity  The  robot  mo  eitlar  an  alternating  tripod  of  -upport  or  wave  gait 
(Keprodured  from  Ozgwncr,  Tsai,  and  Mediae  ItlHl.  f  igure  I  ) 


All  the  robots  referred  to  previously  in  this  section  are  wheeled.  This  restricts 
their  movement  to  (nearly)  even  terrain.  Legged  vehicles  can  potentially  escape 
that  limitation.  The  price  to  be  paid  for  this  advantage  is  the  extra  complexity  of 
maintaining  balance  and  controlling  motion,  following  the  photographic  studies  of 
Muybridge  and  Moayer  irt  the  late  19th  century,  a  theory  of  locomotion  developed 
around  the  concept  of  gait,  the  pattern  of  foot  placements  and  foot  support  duty 
cycles.  The  OSL  hexapod,  for  example,  has  been  programmed  to  use  either  an 
alternating  t r i pod  of  support  or  wave  gait,  in  which  a  left  right  pam  of  legs  is 
lifted  and  advanced  (f  igure  ()/guner.  Tsai,  and  M<  (Jhee  19M  Ifirom  >  I9M, 
quadruped  robot  (figure  li  emplovs  a  crab  trait,  m  which  t  tie  direct  ton  of  motion 
is  at  an  angle  to  the  direction  the  quadruped  is  facing. 

Two  generic  problems  in  legged  locomotion  are  moving  ovr  uneven  terrain  and 
achieving  dynamic  balance  without  a  statu  pyramid  of  support 

The  simplest  walking  machines  employ  a  predetermined .  t:\ed  gait.  1  rteven  ti  rrair; 
requires  dvriamn  determination  ol  foot  pl.teement .  impl.  uu  variable  gait  lliro-a 


Ariiim.i!  Inlrlligrnrr  ami  Kobolir* 


Figure  4.  The  quadruped  robot  built  by  Ilirosr,  Umetani,  and  colleagues  at  Tokyo  Institute  of 
Technology.  The  robol  can  walk  upstairs,  and  can  move  forward  in  a  crab  gait.  (Reproduced 
from  [llirose  r.t.  at.  198-1,  photo  1]) 


[1984]  and  Ozguncr,  Tsai,  and  McGhee  [1984]  analyze  the  constraint  of  balance, 
and  use  sensors  in  their  choice  of  foot  placement. 

Miura  and  Shimoyama  [1984]  and  Raibert  et.  al.  [1982,  1984]  discuss  the  dynamic 
requirements  of  balance.  Miura  and  Shimoyama  reports  a  scries  of  biped  walking 
machines.  The  first  of  these,  BIPER-3  (Figure  5)  has  stilt-like  legs,  with  no  ankle 
joint,  and  resembles  a  novice  nordic  skier  in  its  gait.  BIPER-3  falls  if  both  feet 
keep  contact  with  the  surface;  so  it  must  continue  to  step  if  it  is  to  maintain 
its  balance.  An  ambitious  development,  BIPER-4,  shown  in  Figure  6,  has  knee 
and  ankle  joints.  Stable  walking  of  BIPER-4  has  recently  been  demonstrated. 
Raibert  considers  balance  for  a  three-dimensional  hopping  machine  (Figure  7).  He 
suggests  that  balance  can  be  achieved  by  a  planar  (two-dimensional)  controller  plus 
extra-planar  compensation.  His  work  suggests  that  gait  may  not  be  as  central  to 
the  theory  of  locomotion  as  has  been  supposed.  Instead,  it  may  be  a  side-effect  of 
achieving  balance  with  coupled  oscillatory  systems.  Raibert  [1984]  has  organized  a 
collection  of  papers  on  legged  robots  that  is  representative  of  the  state  of  the  art. 

5.  Reasoningaboutobjectsand  space 


5.1.  Task-level  robot  programming  languages 

Earlier,  we  listed  some  of  the  software  features  of  modern,  commercially  available 
robot  systems:  they  perform  trajectory  calculation  and  kinematic  translation, 
interpret  sense  data,  execute  adaptive  control  through  conditional  execution  and 
real  time  monitors,  interface  to  databases  of  geometric  models,  and  support  program 
development.  Despite  these  features,  robot  programming  is  tedious,  mostly  because 


systems  in  which  the  user  writes  a  computer  program  specifying  motion  and 
sensing,  and  task-level  programming  systems  in  which  the  user  specifies  operations 
by  their  desired  effect  on  objects.”  Languages  such  as  VAL  II  [Shimano,  Gcschke, 
and  Spaulding  1984]  and  AML  [Taylor,  Summers,  and  Meyer  1982]  are  considered 
robot- level. 

One  of  the  earliest  task-level  robot  programming  language  designs  was 
AUTOPASS  [Liebermann  and  Wesley  1977].  The  (unfinished)  implementation 
focussed  upon  planning  collision-free  paths  among  polyhedral  objects.  The  emphasis 
of  RAPT  [Ambler  and  Popplcstone  1975,  Popplestone,  Ambler,  and  Bellos  1980] 
has  been  on  the  specification  of  geometric  goals  and  relational  descriptions  of 
objects.  The  implementation  of  RAPT  is  based  upon  equation  solving  and  constraint 
propagation.  Other  approximations  to  task-level  languages  include  PADL  [Rcquicha 
1980],  IBMsolid  [Wesley  et.  al.  1980],  and  LAMA  [Lozano-Perez  1976].  Lozano-Perez 
[1983c]  discusses  spatial  reasoning  and  presents  an  example  of  the  use  of  RAPT. 

In  the  next  section  we  discuss  Brooks’  work  on  uncertainty,  several  approaches 
to  reasoning  about  space  and  avoiding  obstacles,  and  synthesizing  compliant 

programs. 
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Dealing  with  uncert ainty 

Consider  the  problem  i  1 1  list  ra  t  <d  in  Figure  S.  A  robot  li;is  been  pro*.*  i  :i  in  m<-<i  lo 
(Mil  .1  .screw  in  a  hole.  Will  tfir  program  succeed'’  liacli  of  ilie  join!  me.Min  mrnls 
of  the  robot  is  subject  to  small  errors,  which  produce  errors  in  the  position  and 
orientation  ol'  the  finder  tips  according  to  the  Jacobian  of  the  kinematics  lunction 
I'he  position  and  orientation  of  the  screwdriver  in  the  lingers  is  subject  to  slight 
error,  as  is  the  screw,  box,  and  the  lid  on  the  box.  These  errors,  we  will  call  them 
the  bat sc  errors,  are  independent  of  the  particular  task  to  be  performed.  They  add 
up.  Taylor  !l97(>i  assumed  particular  numerical  bounds  lor  the  values  of  the  base 
errors,  and  used  linear  programming  to  bound  the  error  in  the  placement  of  the 
screw  relative  to  the  hole. 

Brooks  1 1  9S2]  worked  with  explicit  symbolic  (trigonometric)  expressions  that 
define  the  error  in  the  placement  of  the  screw  relative  to  the  hole.  He  applied  the 
expression  bounding  program  developed  for  the  ACRONYM  project  (Brooks  1 9<S  1  ] 
to  the  base  error  bounds  used  by  Taylor  to  deduce  bounds  for  the  errors  in  the 
placement  of  the  screw  relative  to  the  hole,  'i'he  bounds  he  obtained  were  not  as 
tight  as  those  obtained  by  Taylor,  but  were  nearly  so. 


Brooks’  approach  had  a  substantial  advantage  over  Taylor,  however,  and  it  is 
paradigmatic  of  the  AI  approach.  The  expression  bounding  program  ran  be  applied 
with  equal  facility  to  the  symbolic  expression  for  the  error  and  the  desired  size  of 
the  screw  hole  (the  specifications  of  the  insertion  task).  The  result  is  a  bound  on  the 
only  free  variable  of  the  problem,  the  length  of  the  screwdriver.  The  lesson  is  that 
it  is  possible  to  apply  AI  techniques  to  reason  in  the  face  of  uncertainty.  In  further 
work,  Brooks  [1982]  has  shown  how  sensing  might  be  modeled  using  uncertainties 
to  automatically  determine  when  to  splice  a  sensing  step  into  a  plan  to  cause  it  to 
succeed. 


5.3.  Reasoning  about  space  and  avoiding  objects 

Robot-level  programming  languages  require  the  programmer  to  state,  for  example, 
that  the  robot  is  to  move  the  block  U,  whose  configuration  (position  and  orientation) 
Ry  is  to  be  moved  to  the  configuration  Re;.  To  ensure  the  robot  does  not  crash  into 
obstacles,  the  usual  practice  in  robot  level  languages  is  to  specify  a  sufficient  number 
of  via  points  (Figure  9)  (sec  [Brady  1983c]).  In  a  task  oriented  language,  one  merely 
says  something  like  “put  B  in  the  vise”.  It  follows  that  a  crucial  component  of 
implementing  a  task  oriented  programming  language  is  automatically  determining 
safe  paths  between  configurations  in  the  presence  of  obstacles.  This  turns  out  to 
be  an  extremely  hard  problem. 

Lozano-Pcrcz  [1983a]  introduced  a  representation  called  C-space  that  consists 
of  the  safe  configurations  of  a  moving  object.  P'or  an  single  object  moving  with  6 
degrees  of  freedom  (eg.,  3  translational  and  3  rotational  degrees  of  freedom),  the 
dimensionality  of  the  C-space  is  six.  If  there  are  m  such  objects,  each  of  which 
can  move,  the  dimensionality  of  C-space  is  (i m.  P’or  example,  for  the  coordinated 
motion  of  two  3D  objects,  C-space  is  twelve  dimensional.  In  practice  one  can  deal 
with  “slices”,  projections  onto  lower  dimensional  subspaces. 
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Figure  8.  Will  the  screw  make  it  into  ( lie  hole? 

Donald  [198-1]  notes  that  there  arc  two  components  of  spatial  planning  systems. 
First,  it  is  necessary  to  represent  the  problem,  in  particular  the  obstacles.  Second,  it 
is  necessary  to  devise  an  algorithm  that  can  search  for  paths  over  the  representation. 
Most  work  on  spatial  reasoning  has  used  representations  that  approximate  the 
exact  polyhedral  obstacles.  Such  representations  may  (1)  restrict  the  degrees  of 
freedom  in  a  problem,  (2)  bound  objects  in  real-space  by  simple  objects  such  as 
spheres,  or  prisms  with  parallel  axes,  while  considering  some  subset  of  the  available 
degrees  of  freedom,  (3)  quantize  configuration  space  at  certain  orientations,  or  (4) 
approximate  swept  volumes  for  objects  over  a  range  of  orientations.  Systems  that 
use  such  representations  may  not  be  capable  of  finding  solutions  in  some  cases, 
even  if  they  use  a  complete  search  procedure.  An  approximation  of  the  obstacle 
environment,  robot  model,  or  C-spacc  obstacles  can  result  in  a  transformed  find-path 
problem  which  has  no  solution. 

Lozano-l’erez  [1981,  1983]  implemented  an  approximate  algorithm  for  Cartesian 
manipulators  (for  which  free  space  and  C-spacc  are  the  same)  that,  tesselated  free 
space  into  rec.tangloids,  subdividing  it  as  far  as  necessary  to  solve  a  given  problem. 
I'he  search  algorithm  is  complete  for  translations,  and  illustrates  the  feasibility  of 


Start,  P0 


(a) 


Start 


•5*  -  ••  •'  • '  WtT" 

.,  *•  •-•v  .» 


Cool 


.  tr-= — ■  vm 

r»V--  29-' 


(b) 


Figure  fl.  Point  / 1  >  /  2 >  tuid  1 3  arc  specified  as  via  points  to  coax  the  robot,  throng})  the  narrow 
gap  separating  the  two  obstacles  shown.  (Ueproducod  from  (Brady  1983r,  Figure  2a]) 

the  C-spacc  approach.  It  works  by  alternately  keeping  the  heading  of  an  object 
fixed  and  rotating  in  place  to  alter  the  heading.  Recently,  Brooks  and  Lozano-Perez 
[1983]  reported  an  algorithm  capable  of  moving  a  reorientable  polygon  through 
two-dimensional  space  littered  with  polygons.  This  algorithm  can  Gnd  any  path  of 
interest  for  the  two-dimensional  problem.  Figure  10  shows  an  example  path  found 
by  Brooks  and  Lozano-Perez’s  program.  Their  attempts  to  extend  the  method  to 
three  dimensions  “were  frustrated  by  the  increased  complexity  for  three  dimensional 
rotations  relative  to  that  of  rotations  in  two  dimensions”  [Brooks  1983b,  p7). 

Brooks  [1983a]  suggested  that  free  space  be  represented  by  overlapping 
generalized  cones  that  correspond  to  freeways  or  channels.  Figure  11  shows  some 
of  the  generalized  cones  generated  by  two  obstacles  and  the  boundary  of  the 
workspace.  The  key  point  about  the  representation  was  that  the  left  and  right 
radius  functions  defining  a  freeway  could  be  inverted  easily.  Given  a  freeway, 
and  the  radius  function  of  a  moving  convex  object,  he  was  able  to  determine  the 
legal  range  of  orientations  that  ensure  no  collisions  as  the  object  is  swept  down 
the  freeway.  Brooks’  algorithm  is  highly  efficient,  and  works  well  in  relatively 
uncluttered  space,  but  it  occasionally  fails  to  find  a  safe  path  when  it  is  necessary 
to  maneuver  in  tight  spaces.  Recently,  Donald  [1983]  has  proposed  a  novel  channel 
based  technique. 
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Figure  10.  A  pal  I.  Found  In  ilu-  Brooks  ami  Lozano- Perez  program.  (I{eproduro<i  from 
llrook.s  and  l.oznrio-lYrez  I  OSH,  figure  1  la!) 


Finally,  Brooks  [lOS.'ib]  has  developed  an  algorithm  that  combines  the  C-space 
and  freeway  approaches  to  find  paths  for  pick  and  place  and  insertion  tasks  for 
a  PUMA,  Pick  and  place  tasks  are  defined  as  four  degree  of  freedom  tasks  in 
which  the  only  reorientations  permitted  are  about  the  vertical,  and  in  which  the 
found  path  is  composed  of  horizontal  and  vertical  straight  lines.  Figure  12  shows 
an  example  path  found  by  Brooks’  algorithm. 

Brooks  freezes  joint  -1  of  the  PUMA.  The  algorithm  subdivides  free  space  to 
find  (i)  freeways  for  the  hand  and  payload  assembly,  (ii)  freeways  for  the  upper 
arm  subassemblv  (joints  i  and  2  of  the  PI  \1A);  (iii)  searches  for  the  payload  and 


Figure  12.  An  example  of  path  finding  for  a  PUMA  by  brooks’  il9K3bj  algorithm. 

upper  arm  freeways  concurrently  under  the  projection  of  constraints  determined  by 
the  forearm.  The  subdivision  of  free  space  in  this  way  is  the  most  notable  feature 
of  Brooks’  appoach.  It  stands  in  elegant  relation  to  the  algorithms  for  computing 
inverse  kinematics  referred  to  earlier.  It  is  assumed  that  the  payload  is  convex,  and 
that  the  obstacles  are  convex  stalagmites  and  stalactites.  It  is  further  assumed  that 
stalactites  arc  in  the  workspace  of  the  upper  arm  of  the  PUMA,  not  of  the  payload. 

By  restricting  attention  to  a  limited  class  of  tasks,  Brooks  has  designed  an 
efficient  algorithm  will  not  work  in  all  cases.  The  advantage  is  that  he  does  not 
have  to  contend  with  worst  case  situations  that  lead  to  horrendous  polynomial 
complexity  estimates,  for  example,  Schwartz  and  Sharir  [1983]  suggest  a  method 
whose  complexity  for  r  DOF  is  n2  .  For  example,  for  two  moving  objects,  the 
complexity  is  n'10<J6.  Their  algorithm  is  not  implemented.  Donald  [IDS'!]  presents 
the  first  implemented,  representation-complete,  search-complete  algorithm  (at  a 
given  resolution)  for  the  classical  Movers’  problem  for  Cartesian  manipulators. 

There  has  been  a  great  deal  of  theoretical  interest  in  the  find  path  problem  by 
researchers  in  computational  complexity  and  computational  geometry.  Schwartz 
and  Sharir  [1983],  and  Ilopcroft  ,  Schwartz,  and  Sharir  [1983]  are  representative. 
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i-  an  important  example  ul  the  use  ol  lone  t  ra  jci  t  or ie>  ’  o  ,u  compliant 

assembly.  Another  example  i-  scribing  a  straight  line  on  an  n'ni..,at  ng  mii  lace.  In 
that  case,  it  is  neccv-a  r  v  to  control  pm  it  ion  in  the  tangent  pta  :.i  .  :  :,<  *-a  r!  ace.  and 

maintain  (  ontm  t  with  the  --nr  face  bv  ap|»l>  in  ft  an  approprta’c  -ct  din;;-  lorce  normal 
to  the  surface  Ol  lier  examples  ol  compliance  include  cutt:m.\  s'rctt  insertion.  and 
bayonet  style  fixtures,  such  as  camera  mountings.  Olocksin  <  in'.  I9vj  describe 
a  seam  welding  robot  that  uses  the  dillerenee-ol  Oaussians  edge  opt  ralor  proposed 
by  Marr  and  Hildreth  1980  to  determine  the  welding  trajectory  Oliwovoriolc  and 
Roth  19, SI;  show  how  the  motions  possible  at  an  assembly  step  can  be  partitioned 
into  3  classes:  those  that  tend  to  separate  the  bodies  to  be  mated:  those  that  tend 
to  make  one  body  penetrate  the  other;  and  those  that  move  the  body  arid  maintain 
the  original  constraints.  Theoretically,  this  provides  a  basis  for  choosing  tire  next 
step  in  an  assembly  sequence. 

Trevelyan.  Kovosi.  and  Ong  [198-lj  describe  a  sheep  shearing  robot.  Figure 
13  shows  the  geometry  of  the  robot  and  the  “workpiece’’.  Trevelyan,  Kovosi,  and 
Ong  ;19ST  note  that  "over  two  hundred  sheep  have  been  shorn  by  the  machine 
(though  not  completely)  yet  only  a  few  cuts  have  occurred.  This  extremely  low 
injury  rate'  results  from  the  use  of  sensors  mounted  on  the  shearing  cutter  wdiich 
allow  the  computer  controlling  the  robot  to  keep  the  cutter  moving  just  above 
the  sheep  skin’’.  Trajectories  are  planned  from  a  geometric:  model  of  a  sheep  using 
Bcrnstein-Hezicr  parametric  curves.  The  trajectory  is  modified  to  comply  with 
sense  data.  Two  capacitance  sensors  are  mounted  under  the  cutter  just  behind  the 
comb.  These  sensors  can  detect  the  distance  between  the  cutter  and  the  sheep’s 
skin  to  a  range  of  approximately  30mm.  Compliance  is  needed  to  take  account  of 
inaccuracies  in  the  geometric  model  of  the  sheep  and  the  change  in  shape  of  the 
sheep  as  it  breathes. 

Robots  evolved  for  positional  accuracy,  and  are  designed  to  be  mechanically 
stiff.  High  tolerance  assembly  tasks  typically  involve  clearances  of  the  order  of  a 
thousandth  of  an  inch.  In  view  of  inaccurate  modeling  of  the  world  and  limitations 
on  joint  accuracy,  low  stiffnesses  arc  required  to  effect  assemblies.  Devices  such  as 
the  Remote  Center  Compliance  (RCC)  [Whitney  1983J  and  the  Ili-T  hand  [Colo 
1980)  exploit  the  inherent  mechanical  compliance  of  springs  to  accomplish  tasks. 
Such  passive  compliant  devices  are  fast,  but,  the  specific  application  is  built  into  the 
mechanical  design.  The  challenge  is  that  different  tasks  impose  different  stiffness 
requirements,  in  active  compliance,  a  computer  program  modifies  the  trajectory  of 
the  arm  on  the  basis  of  sensed  forces  (and  other  modalities)  [Paul  and  SHmano 

‘Trevelyan  informs  the  author  that  a  tinman  sheep  shearer  type  ally  ruts  a  sheep  ,  ,'!()  anes, 
and  that  serums  cuts  occur  regularly . 


1976].  Active  compliance  is  a  general  purpose  technique;  but  is  typically  slow 
compared  to  passive  compliance. 

Mason  [1981]  suggested  that  the  (fixed  number  of)  available  degrees  of  freedom 
of  a  task  could  be  divided  into  two  subsets,  spanning  orthogonal  subspaccs.  The 
subspaces  correspond  one-one  with  the  natural  constraints  determined  by  the 
physics  of  the  task,  and  the  artificial  constraints  determined  by  the  particular 
task.  Sec  [Mason  19S3]  for  details  and  examples.  For  example,  in  screwdriving, 
a  screwdriver  cannot  penetrate  the  screw,  giving  a  natural  constraint;  successful 
screwdriving  requires  that  the  screwdriver  blade  be  kept  in  the  screw  slot,  an 
artificial  constraint.  Raibcrt  and  Craig  [1983]  refined  and  implemented  Mason’s 
model  as  a  hybrid  force-position  controller.  Raibert  and  Craig’s  work  embodies  the 
extremes  of  stiffness  control  in  that  the  programmer  chooses  which  axes  should 
be  controlled  with  infinite  stiffness  (using  position  control  with  an  integral  term) 
and  which  should  be  controlled  with  zero  stiffness  (to  which  a  bias  force  is  added). 
Salisbury  [1980]  suggests  an  intermediate  ground  that  he  calls  “active  stiffness 
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l.o/. mo- 1  ’ore/,  Mason.  and  Tavlor  I9vl  have 
rally  synthesizing  compliant  motions  from  geometric  de-crip;  tons  ol  a  task,  rite 
approach  combines  Mason's  ideas  about  compliance,  l.o/ano- Perez's  ('-space,  and 
Taylor  s  197b  proposal  for  programming  robots  by  lleshirtg  out  skeletons  forming 
a  library  of  operations.  The  approach,  currently  being  implemented,  deals  head-on 
with  errors  in  assumed  position  and  heading.  l.ozano-IVrez,  Mason,  and  Taj' lor  use 
a  generalized  damper  model  to  determine  all  the  possible  configurations  that  can 
result  from  a  motion.  It  is  necessary  to  avoid  bring  jammed  in  the  friction  cone 
of  any  of  the  surfaces  ct;  route  to  the  goal  surface.  This  sets  up  a  constraint  for 
each  surface.  Intersecting  the  constraints  leaves  a  range  of  possible  {sequences  of) 
compliant  motes  that  are  guaranteed  to  achieve  the  goal,  notwithstanding  errors. 

Programming  by  ilc-h.ng  out  skeletons  is  reminiscent  of  the  programmer's 
nppri  rJirt.  Rich  and  Waters  19X1!.  'fhc  similarities  are  that  the  computer  adopts 
the  role  of  junior  partner  or  critic,  programming  is  based  on  cliches,  and  design 
decisions  and  logical  dept  ndenries  are  explicitly  represented  so  that  the  effects  of 
modtfica'  tons  to  a  program  can  be  automatically  propogatod  through  the  program. 
The  difference  is  that  a  robot  programmer's  apprentice  works  with  rich  geometric 
models.  I.ozano-Perez  has  suggested  that  guiding  can  be  extended  to  teach  a  robot 
plans  that  involve  sensing,  a  large  number  of  similar  movements  (for  example 
unloading  a  palette),  ami  asynchronous  control  of  multiple  manipulators.  The 
requirement  that  a  system  deal  with  rich  geometric  models  also  distinguishes  the 
robot  programmer  s  apprentice  from  earlier  work  in  AI  planning  'Sacerdoti  1975], 


6.  Perception 


6.1.  Introduction 


The  perceptual  abilities  of  commercially  available  robots  arc  severely  limited, 
rsperially  when  compared  with  laboratory  systems.  It  is  convenient  to  distinguish 
cont.ii  t  and  rion-<ontart  sen-ing.  Contact,  or  local,  sensing  includes  tactile, 
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Rohol  percept  ion  is  only  a  special  ease  ol  roripuh  r  perception  in  (lie  sense  that 
there  are  occasional  opportunities  lor  engineering  solutions  to  what  ate.  in  general, 
difficult  prolih-ms.  Examples  include:  arrangin'!,  the  lighting,  controlling,  positional 
uncertainty,  finessing  some  of  the  issues  in  depth  computation,  and  limiting  the 
visual  context  ol  a ri  object.  Appropriate  lighting  can  avoid  shadows,  light  striping 
and  laser  range  finding  can  produce  partial  depth  maps,  and  techniques  such  as 
photometric  stereo  [Wood ham  MINI]  can  exploit  control  over  lighting.  On  the  other 
hand,  edge  finding  is  no  less  difficult  in  industrial  images,  texture  is  just  as  hard, 
and  the  bin  of  parts  is  a  tough  nut  for  stereo.  Motion  tracking  on  a  dirty  conveyor 
belt  is  as  hard  as  any  other  tracking  problem.  Representing  the  shape  of  complex 
geometric  parts  is  as  difficult  as  any  representational  problem  in  computer  vision 
jl'augeras  et.  al.  1983,  1981,  Brady  and  Asada  1981,  Holies  ct.  al.  198lj.  Existing 
commercial  robot  vision  systems  carry  out  simple  inspection  and  parts  acquisition. 
There  are.  however,  many  inspection,  acquisition,  and  handling  tasks,  routinely 
performed  bv  humans,  that  exceed  the  abilities  of  current  computer  vision  and 
tactile  sensing  research. 

The  quality  of  sensors  is  increasing  rapidly,  especially  as  designs  incorporate 
VLSI.  The  interpretation  of  sensory  data,  especially  vision,  has  significantly 
improved  over  the  past  decade.  Sensory  data  interpretation  is  computer  intensive, 
requiring  billions  of  cycles.  However,  much  of  the  computer  intensive  early  processing 
naturally  calls  for  local  parallel  processing,  and  is  well  suited  to  implementation  on 
special  purpose  VLSI  hardware  [Brady  1983a,  Raibert  and  Tanner  1982], 

6.2.  Contact  sensing 

Contact  sensing  is  preferred  when  a  robot  is  about  to  be,  or  is,  in  contact 
with  some  object  or  surface.  In  such  cases,  objects  are  often  occluded,  even  when 
a  non-contact  sensor  is  mounted  on  a  hand.  An  exception  to  this  is  -.cam  welding 
[Clocksin  et.  al.  1982).  The  main  motivation  for  force  sensing  is  not,  however,  to 
overcome  occlusion,  but  to  achieve  compliant  assembly.  Force  sensors  have  improved 
considerably  over  the  past  two  or  three  years.  Typical  sensitivities  range  from  a 
half  ounce  to  ten  pounds.  Most  work  on  force  trajectories  has  been  application 
specific  (eg  peg-in-hole  insertion).  Current  research,  aimed  at  developing  general 
techniques  for  interpreting  force  data  and  synthesizing  compliant  programs,  were 
discussed  in  the  previous  section.  Kanadc  and  Sommer  [1981]  and  Okada  [  1 982] 
report  proximity  sensors. 

Touch  sensing  is  currently  the  subject  of  intensive  research.  Manufacturing 
engineers  consider  tactile  sensing  to  be  of  vital  importance  in  automating  assembly 
[Harrnon  1982,  1984].  Unfortunately,  current  tactile  sensors  leave  much  to  be 
desired.  They  are  prone  to  wear  and  tear,  have  poor  hysteresis,  and  low  dynamic 
range.  Industrially  available  tactile  sensors  typically  have  a  spatial  resolution  of 
only  about  8  points  per  inch.  Tactile  sensors  are  as  poor  as  TV  cameras  were  in 
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Figure  H.  S;i ni [>?<•  tactile  images  from  (lie  llillis  sensor.  (Reproduced  from  (llillis  10S2, 
i’igiire  (i'j) 

the  1960s,  the  analogy  being  that  they  are  seriously  hampering  the  development  of 
tactile  interpretation  algorithms. 

Several  laboratory  demonstrations  point  the  way  to  future  sensors.  llillis  [1982] 
devised  a  tactile  sensor  consisting  of  an  anisotropic  silicon  conducting  material 
whose  lines  of  conduction  were  orthogonal  to  the  wires  of  a  printed  circuit  board 
and  which  were  separated  by  a  thin  spacer.  Figure  M  shows  some  example  touch 
images  generated  by  llillis'  tactile  sensor  for  four  small  fasteners.  The  sensor  had 
a  spatial  resolution  of  256  points  per  square  centimeter.  Raibcrt  and  Tanner  [1982] 
developed  a  VLSI  tactile  sensor  that  incorporated  edge  detection  processing  on 
the  chip  (Figure  15).  'This  (potentially)  significantly  reduces  the  bandwidth  of 
communication  between  the  sensor  and  the  host  computer.  Recently,  Haekwood 
and  Beni  [19.83]  hav-  developed  a  tactile  sensor  using  magneto-restrictive  materials 
that  appears  to  be  able  to  compute  shear  forces. 

Little  progress  has  been  made  in  the  development  of  tactile  object  recognition 
algorithms.  llillis  built  a  simple  pattern  recognition  program  that  could  recognize 
a  variety  of  fasteners.  Gaston  and  Lozano- Perez  (19831  have  built  a  program 
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Figure  15.  Architecture  of  the  V  I, SI  tactile  sensor  developed  by  Kaibert  and  Tanner.  A 
layer  of  pressure-sensitive  rubber  is  placed  in  contact  with  a  VLSI  wafer.  Metalization  on  the 
surface  of  the  wafer  forms  large  sensing  electrodes  that  make  contact  with  the  pressure-sensitive 
rubber  through  holes  in  it  protective  layer  of  SiO-t,  the  overglass.  (l{eproduced  from  [llaibert  and 
Tanner  lf)82,  Figure  Ij) 

that  constructs  an  interpretation  tree  for  a  class  of  two-dimensional  objects.  The 
program  assumes  that  there  are  n  discrete  sensors,  at  each  of  which  the  position 
and  an  approximate  measure  of  the  surface  orientation  is  known.  They  show  how 
two  constraints,  a  distance  constraint  and  a  constraint  on  the  normal  directions 
at  successive  touch  points,  can  substantially  cut  down  the  number  of  possible 
grasped  object  configurations.  Grimson  and  Lozano-Pcrcz  [1983]  have  extended  the 
analysis  to  three  dimensions.  Faugeras  and  Hebert  [1983]  have  developed  a  similar 
three-dimensional  recognition  and  positioning  algorithm  using  geometric  matching 
between  primitive  surfaces.  Bajcsy  [1984]  has  investigated  the  use  of  two  tactile 
sensors  to  determine  the  hardness  and  texture  of  surfaces. 

6.3.  Non-contact  sensing 

Non-contact  sensing  is  important  for  a  variety  of  applications  in  manufacturing. 
These  include: 

•  Inspection.  Most  current  industrial  inspection  uses  binary  two-dimensional 
images.  Only  recently  have  grey  level  systems  become  commercially  available. 
No  commercial  system  currently  offers  an  modern  edge  detection  system.  Two 
dimensional  inspection  is  appropriate  for  stamped  or  rotationally  symmetric  parts. 
Some  experimental  prototypes  [Porter  and  Mundy  1982,  Faugeras  et.  al.  1983] 
inspect  surfaces  such  as  engine  mountings  and  airfoil  blades. 

•  parts  acquisition.  Parts  may  be  acquired  from  conveyor  belts,  from  palettes, 
or  from  bins.  Non-contact  sensing  means  that  the  position  of  parts  may  not  be 
accurately  specified.  Parts  may  have  to  be  sorted  if  there  is  a  possibility  of  more 
than  one  typo  being  present. 
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Active  sensing;  tins  been  developed  m.iinlv  lor  military  .qiplic.it  inn--  Image 
iinder-l anding  is  d i f lie  11  It  ;nul  rc(|mrcs  a  great  dial  ol  computer  power.  I  l.lli.  S A H . 
and  millimeter  radar  miaeerv  oiler  limited,  i  ompiilat  ionalK  expedient,  solutions  to 
dillicnlt  vision  problems.  The  algorithms  that  have  been  de\<  loped  lor  i--ol.it  mg  and 
ident living  targets  in  natural  -reties  art'  restricted  in  scope.  They  do  not  generalize 
easily  to  manufacturing  settings,  where,  for  example,  most  object-  are  hot  . 

\  ision  has  the  most  highlv  developed  theory  ,  and  the  best  sensors  Now  one  can 
get  high  qualitv  solid  state  cameras.  The  rapid  increase  in  the  quality  ol  solid  state 
cameras  has  been  accompanied  by  the  rapid  development  of  image  understanding 
techniques. 

Marly  vision  processes  include  edge  and  region  finding,  texture  analysis,  and 
motion  computation.  All  these  operat ions  are  well  suited  to  local  parallel  processing. 
Developments  in  edge  finding  include  the  work  of  Ylarr  and  Hildreth  [198(1],  Haralick 
[1982],  and  Canny  [1983].  Developments  in  grouping  include  the  work  of  Howe  and 
Binford  [1983].  Hildreth  [1983]  has  developed  a  system  for  computing  directional 
selectivity  of  motion  using  the  Nlarr-Ilildreth  edge  finder.  Horn  and  Schunck  [1981] 
and  Schunck  [1983]  have  shown  how  to  compute  the  optic  flow  field  from  brightness 
patterns.  (Bruss  and  Horn  [1983]  have  developed  an  analysis  of  how  the  flow  field 
can  be  used  in  passive  navigation.)  Brady  [ 1 983b .  Brady  1983d,  Brady  and  Asada 
1984]  has  developed  a  new  technique  for  representing  two-dimensional  shape,  and 
has  applied  it  to  inspection. 

The  major  breakthrough  in  vision  over  the  past  decade  has  been  the  development 
of  three  dimensional  vision  systems.  These  are  usually  referred  to  as  “shape  from” 
processes.  Examples  include:  shape  from  stereo  [Crimson  1981,  Baker  and  Binford 

1981,  Binford  1984,  Nishihara  and  Poggio  1984],  shape  from  shading  [Ikeuchi  and 
Horn  1983],  shape  from  contour  [Witkin  1981,  Brady  and  Yuille  1983],  and  shape 
from  structured  light  [Porter  and  Mundy  1982,  Faugeras  1983,  Clocksin  et.  al. 
1983,  Bolles,  Horaud,  and  Hannah  1984,  Tsuji  1984]. 

Most  of  these  “shape  from”  processes  produce  partial  depth  maps.  Recently, 
fast  techniques  for  interpolating  full  depth  maps  have  been  developed  [Terzopoulos 

1982,  1983].  A  current  area  of  intense  investigation  is  the  representation  of  surfaces 
Faugeras,  Hebert,  and  Ponce  1984,  Shirai  et.  al.  1984,  Ikeuchi  and  Horn  1983, 
Brady  and  Yuille  1984].  f  inally,  recent  work  by  Brooks  [1981]  discusses  object 
representation  and  the  interaction  between  knowledge  guided  and  data  driven 
processing. 

7.  Reasoningthat  connectsperceptionto  action 

This  final  section  is  speculative.  It  presents  three  examples  of  reasoning  and 
problem  solving  that  we  are  striving  to  make  robots  capable  of.  The  aim  is  to 
illustrate  the  kinds  of  tilings  we  would  like  a  robot  to  know,  and  the  way  in  which 
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Figure  16.  What  is  this  tool,  and  what  is  it  for? 

that  knowledge  might  be  used.  The  knowledge  that  is  involved  concerns  geometry, 
forces,  process,  space,  and  shape.  The  examples  involve  tools.  They  concern  the 
interplay  between  the  use  or  recognition  of  a  tool  and  constraints  on  the  use  of 
tools.  Reasoning  between  structure  and  function  is  particularly  direct  in  the  case 
of  tools.  Shape  variations,  though  large  (there  are  tens  of  kinds  of  hammer),  are 
lessened  by  the  fact  that  tools  are  rarely  fussily  adorned,  since  such  adornments 
would  get  in  the  way  of  using  the  tool. 

7.1.  What  is  that  tool  for? 

What  is  the  tool  illustrated  in  Figure  16,  and  how  is  it  to  be  used?  We 
(reasonably)  suppose  that  a  vision  program  [Brady  and  Asada  1984,  Brady  1984] 
computes  a  description  of  the  object  that,  based  on  the  smoothed  local  symmetry 
axis,  partially  matches  a  crank.  The  model  for  a  crank  indicates  that  it  is  used  by 
fixing  the  end  P  onto  some  object  0,  and  rotating  the  object  0  about  the  symmetry 
axis  at  P  by  grasping  the  crank  at  the  other  end  Q  and  rotating  in  a  circle  whose 
radius  is  the  length  of  the  horizontal  arm  of  the  crank.  Further  investigation  of  the 
crank  model  tells  us  that  it  is  used  for  increasing  the  moment  arm  and  hence  the 
torque  applied  to  the  object  0 .  We  surmise  that  the  tool  is  to  be  used  for  increasing 
torque  on  an  object  0.  We  have  now  decided  (almost)  how  the  tool  is  to  be  used, 
and  we  have  a  hypothesis  about  its  purpose.  The  hypothesis  is  wrong. 

The  one  thing  that  we  do  not  yet  know  about  how  to  use  the  tool  is  how  to  fix 
it  at  P  to  the  object  at  0 .  There  are  many  possibilities,  the  default  being  perhaps 
a  socket  connector  for  a  nut  (as  for  example  on  a  tire  lever).  Closer  inspection 
of  the  description  computed  by  our  vision  program  shows  that  the  ends  of  the 
crank  are  screwdriver  blades,  set  orthogonal  to  each  other.  Only  screwdrivers 
(in  our  experience)  have  such  blades.  Apart  from  the  blade,  the  tool  bears  some 
resemblance  to  a  standard  screwdriver,  which  also  has  a  handle  and  a  shaft.  In 
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the  standard  screwdriver,  however,  the  axes  of  the  shaft  and  handle  are  eollinear. 
Kvidently,  the  tool  is  a  special  purpose  screwdriver,  since  only  screwdrivers  have 
such  blades. 

Tools  have  the  shape  that  they  do  in  order  to  solve  some  problem  that  is  difficult 
or  impossible  to  solve  with  more  generally  useful  forms.  So  why  the  crank  shape? 
What  problem  is  being  solved  that  could  not  be  solved  with  a  more  conventional 
screwdriver?  Here  are  some  screwdriver-specific  instances  of  general  problems  that 
arise  using  tools: 

•  Parts  interface  bug.  A  part  docs  not  match  the  part  to  which  it  is  being 
applied  or  fastened.  For  example,  a  wrench  might  be  too  small  to  span  a  nut;  a 
sledgehammer  is  inappropriate  for  driving  a  tack.  The  screwdriver  head  may  not 
match  the  screw  (one  might  be  Philips  type).  There  is  no  evidence  for  this  bug  in 
Figure  16  because  the  fastener  is  not  shown. 

•  Restricted  rotary  motion  bug.  A  tool  that  is  operated  by  turning  it  about  some 
axis  has  encountered  an  obstruction  that  prevents  it  turning  further.  This  bug 
occurs  frequently  in  using  wrenches.  A  socket  wrench  solves  it  by  engaging  a  gear 
to  turn  the  wrench  in  one  direction,  disengaging  the  gear  to  rotate  in  the  other. 
How  is  it  solved  more  generally?  There  is  an  analogous  restricted  linear  motion  bug. 
Think  of  an  example!  (one  is  given  at  the  end  of  the  section.) 

•  Restricted  access  bug.  As  anyone  owning  a  particular  (expensive)  kind  of  British 
car  knows  only  too  well,  often  the  hardest  part  of  using  a  tool  is  mating  it  to 
the  corresponding  part.  Many  tools  have  an  axis  along,  or  about,  which  they  are 
applied.  The  most  common  version  of  the  restricted  access  bug  is  when  the  axis  is 
too  long  to  fit  into  the  available  space.  In  the  case  of  screwdriving,  this  occurs  when 
the  screwdriver  is  restricted  vertically  above  the  screw.  A  short,  stubby  screwdriver 
is  the  usual  solution  to  this  problem. 

Can  the  crank-screwdriver  also  overcome  restricted  access  bugs?  Of  course. 
The  geometric  form  of  the  crank- screwdriver  is  necessary  to  solve  this  restricted 
workspace  problem,  rather  than  being  a  torque  magnifier  as  initially  hypothesized. 
In  fact,  the  tool  is  called  an  offset  screwdriver.  Figure  17  illustrates  its  use. 

Since  I  first  presented  this  example,  another  solution  to  the  restricted  access 
bug  has  been  brought  to  my  attention.  Figure  18  shows  a  screwdriver  whose  shaft 
can  bend  about  any  axis  orthogonal  to  it. 

Why  are  the  blades  of  an  offset  screwdriver  set  orthogonal  to  one  another? 
Put  differently,  what  bug  do  they  help  overcome?  What  would  you  need  to  know 
in  order  to  figure  it  out? 

No  program  is  currently  capable  of  the  reasoning  sketched  above.  Pieces  of 
the  required  technology  arc  available,  admittedly  in  preliminary  form,  and  there  is 
cause  for  optimism  that  they  could  be  made  to  work  together  appropriately.  First, 
vision  programs  exist  that  can  almost  generate  the  necessary  shape  descriptions 
and  model  matching  [Brady  and  Asada  1981,  Brady  1981).  There  is  considerable 
interplay  between  form  and  function  in  the  reasoning,  and  this  has  been  initially 
explored  by  Winston,  Binford,  and  their  colleagues,  combining  the  ACRONYM 
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system  of  shape  description  and  Winston’s  analogy  program  [Winston,  Binford, 
Katz,  and  Lowry  1982].  To  figure  out  that  the  crucial  thing  about  the  form  is  its 
ability  to  overcome  a  restriction  in  the  workspace,  it  is  necessary  to  be  able  to 
reason  about  space  and  the  swept  volumes  of  objects.  This  is  the  contribution  of 
Lozano-Perez  [1981,  1983],  Brooks  [1983b],  and  Lozano-Perez,  Mason,  and  Taylor 
[1983].  Forbus  [1983]  is  developing  a  theory  of  processes,  a  system  that  can  reason 
about  physical  processes  like  water  flow,  heat,  and  springs.  This  builds  upon  earlier 
work  by  DeKleer  [1975]  and  Bundy  [1979]. 

Answer  to  the  problem:  An  example  of  a  restricted  linear  motion  bug :  Trying  to 
strike  a  nail  with  a  hammer  when  there  is  insufficient  space  to  swing  the  hammer. 

7.2.  Why  are  wrenches  asymetric? 

Figure  19a  shows  a  standard  (open-jawed)  wrench.  Why  is  it  asymmetric?  To 
understand  this  question,  it  is  necessary  to  understand  how  it  would  most  likely  be 
judged  asymmetric.  This  involves  finding  the  head  and  handle  [Brady  and  Asada 
1984],  assigning  a  “natural”  coordinate  frame  to  each  [Brady  1982,  1984],  and 
realizing  that  they  do  not  line  up.  Since  the  handle  is  significantly  longer  that  the 
head,  it  establishes  a  frame  for  the  whole  shape,  so  it  is  the  head  that  is  judged 
asymmetric  about  the  handle  frame. 

Now  that  we  at  least  understand  the  question,  can  we  answer  it?  We  are 
encouraged  to  relate  a  question  of  form  to  one  of  function.  What  is  a  wrench  for, 
and  how  is  it  used?  It  is  used  as  shown  in  Figure  19b:  the  head  is  placed  against  a 
nut;  the  handle  is  grasped  and  moved  normal  to  its  length;  if  the  diameter  of  the 
nut  and  the  opening  of  the  jaws  of  the  wrench  match,  the  nut  (assumed  fixed)  will 
cause  the  handle  to  rotate  about  the  nut.  Nowhere  is  mention  made  of  asymmetry. 
Surely,  a  symmetric  wrench  would  be  easier  to  manufacture.  Surely,  a  symmetric 
wrench  would  be  equally  good  at  turning  nuts.  Or  would  it? 
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problem  that  a  "standard  .  here  symmetric.  wrench  < ould  not  solve  The 
problem  that  ari'is  using  a  wrench  is  the  rest  ruled  rotary  motion  buy,.  In 
a  tasks  there  is  an  interval  j  O  \ ,  O  j  (measured  from  the  lot  a  I  coordinate  frame 
^  /responding  to  the  axis  ol  the  handle)  through  whit  h  the  wrench  t  an  he  rotated. 
;■  /he  erueial  ohservation  is  that  a  wrench  is  (effectively)  a  lamina.  As  such,  it  has  a 
degree  ol  Freedom  corresponding  to  flipping  it  over.  Kxploit ing  t his  degree  ol  freedom 
makes  no  ihfft  rt  nct  to  the  elleet  ive  workspace  ol  a  symmetric  wrench.  It  doubles 
the  workspace  of  an  asymmetric  wrench,  giving  [-  £>_>.  -  Q\ ]  U'Oi ,  O-j j .  In  this  way, 
an  asymmetric  wrench  partially  solves  the  restricted  rotary  motion  bug.  Perhaps 
this  suggests  how  to  dtsujn  the  head  of  a  wrench,  say  by  minimizing  j  —  0|,<pi] 
subject  to  keeping  the  jaws  parallel  to  each  other.  Perhaps  it  also  suggests  (for 
example  to  Winston's  (lOKOj  analogy  program)  that  other  turning  tools  should  be 
asymmetric,  for  analogous  reasons.  There  arc  many  examples,  of  course,  including 
the  offset  screw'driver  discussed  in  the  previous  section. 

7.3.  How  to  disconnect  a  battery 

The  Gnal  example  in  this  section  concerns  a  familiar  AI  technique:  debugging 
“almost  right”  plans.  A  battery  is  to  be  disconnected.  The  geometry  of  the  terminal 
is  shown  in  Figure  20a.  A  plan,  devised  previously,  uses  two  socket  w'renches,  one 
acting  as  an  adaptable  fixture,  the  other  to  turn  the  bolt.  The  socket  wrenches  are 
applied  along  their  axes,  which  coincide  with  the  axes  of  the  nut  and  bolt. 

A  new  model  of  the  battery-laden  device  is  delivered.  The  plan  will  no  longer 
work  because  there  is  an  obstacle  to  the  left  of  the  head  of  the  nut,  restricting 
travel  of  the  socket  wrench  along  its  axis.  Using  something  akin  to  dependency 
directed  reasoning,  truth  maintenance,  or  a  similar  technique  for  recording  the 
causal  connections  in  a  chain  of  reasoning,  we  seek  a  method  for  adapting  our 
alrnost-right  plan  to  the  new  circumstance.  There  arc  a  variety  of  techniques,  one 
of  which  was  illustrated  in  the  offset  screwdriver.  That  suggests  bending  a  tool  so 
that  torque  can  be  applied  by  pushing  on  a  lever.  In  fact,  socket  wrenches  have 
this  feature  built-in,  so  the  problem  was  easy. 

Unfortunately,  the  fix  did  not  work.  Figure  20b  shows  why.  The  new  model 
has  insufficient  clearance  for  a  socket  wrench  to  fit  around  the  head  of  the  bolt. 
We  further  reconsider  the  plan,  adding  the  new  constraints,  removing  those  parts 
that  were  dependent  upon  being  able  to  use  the  socket  wrench  (none,  in  this  case). 
The  next  idea  is  to  use  a  different  kind  of  wrench,  but  this  will  not  work  either, 
again  because  of  the  insufficient  clearance.  We  need  to  be  able  to  grasp  one  of  the 
nut  and  bolt  securely  and  turn  the  other.  We  can  hold  the  bolt  and  turn  the  nut. 
Several  ways  are  available,  most  obviously  using  a  needicnose  plicr  to  secure  the 
bolt. 

8.  Conclusion 

Since  Robotics  is  the  connection  of  perception  to  action,  Artificial  Intelligence 
must  have  a  central  role  in  Robotics  if  the  connection  is  to  be  intelligent.  We  have 
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Figure  20.  a.  Tli  r  geometry  of  a  battery  terminal,  b.  side  view  of  the  late  model  bolt. 

illustrated  current  interactions  between  AI  and  Robotics  and  presented  examples  of 
the  kinds  of  things  we  believe  it  is  important  for  robots  to  know.  We  have  discussed 
what  robots  should  know,  how  that  knowledge  should  be  represented,  and  how  it 
should  be  used.  Robotics  challenges  AI  by  forcing  it  to  deal  with  real  objects  in 
the  real  world.  An  important  part  of  the  challenge  is  dealing  with  rich  geometric 
models. 
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